Weighted Principal Component Mllr for Speaker Adaptation

نویسندگان

  • Sam-Joo Doh
  • Richard M. Stern
چکیده

We present and describe two new speaker adaptation methods which apply principal component analysis to maximum likelihood linear regression (MLLR). If we apply MLLR after transforming the baseline mean vectors by their eigenvectors, the contributions of these eigenvalues to the variance of the estimates for the MLLR matrix are inversely proportional to their corresponding eigenvalues. In the first new technique, called Principal Component MLLR (PC-MLLR), we reduce estimator variance (at the expense of increased bias) by eliminating the contributions of principal components corresponding to smaller eigenvalues. The second technique, called Weighted Principal Component MLLR (WPC-MLLR) makes use of the contributions of all principal components, but weights them according to the inverse of their putative variance. In experiments using sentences from Spokes 0 and 3 from the 1994 DARPA Wall Street Journal evaluation, the use of WPC-MLLR provided a relative reduction in word error rates of 15.1% for non-native speakers and 6.0% for native speakers compared to conventional MLLR.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Eigenspace-based speaker adaptation methods in Persian speech recognition systems

Among speaker adaptation algorithms, eigenvoice (EV) and eigenspace-based MLLR (EMLLR) adaptation approaches have been proposed for rapid adaptation with very limited adaptation data. In these methods, a speaker adapted model is constrained to be a weighted combination of some orthogonal basis vectors. In this manner, both the number of parameters to be estimated from the adaptation data, and t...

متن کامل

Using class weighting in inter-class MLLR

A new adaptation method called inter-class MLLR has recently been introduced. Inter-class MLLR utilizes relationships among different transformation functions to achieve more reliable estimates of MLLR parameters across multiple classes, and it produces lower word error rates (WER) than conventional MLLR in circumstances where very little speaker-specific adaptation data are available. This pap...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Improving eigenspace-based MLLR adaptation by kernel PCA

Eigenspace-based MLLR (EMLLR) adaptation has been shown effective for fast speaker adaptation. It applies the basic idea of eigenvoice adaptation, and derives a small set of eigenmatrices using principal component analysis (PCA). The MLLR adaptation transformation of a new speaker is then a linear combination of the eigenmatrices. In this paper, we investigate the use of kernel PCA to find the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999